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I.  INTRODUCTION 


During  the  previous  quarter,  several  sets  of  sonar  sea  data  were 
quantized  and  stored  on  digital  tape.  These  data  represented  five 
different  transmission  modes  and  it  was  intended  to  use  these  data  for 
a statistical  study  of  cepstrum  analysis.  At  that  time  the  cepstrums 
that  were  produced  showed  very  poor  results  and  the  statistical  study 
was  delayed  until  a reason  could  he  found  to  explain  the  poor  results. 

It  was  speculated  that  several  reasons  might  account  for  the  poor 
results:  one  reason  might  have  been  programming  errors  and  another 
reason  might  have  been  poor  data. 

During  the  present  quarter  an  effort  has  been  made  to  explain  fr!>-  1 

' t-he-ee'-probleme . > A carefully  controlled  experiment  was  conducted  at 

the  Applied  Research  Laboratories  Lake  Travis  Test  Station,  using 
three  styrofoam  cylinders  as  a target.  The  transmission  and  return 
signals  were  recorded  on  analog  tape  and  returned  to  the  laboratory. 

The  signals  were  then  quantized  and  stored  on  digital  tape  for  later 
analysis.  The  results  of  that  analysis  are  included  in  this  report.  ^ 

In  addition,  a study  using  artificially  generated  data  was  used 
to  calculate  the  processing  gain  of  cepstrum.  This  was  compared  with 
the  processing  gain  for  a replica  correlator  and  an  autocorrelator. 

The  results  of  this  study  are  important  and  they  are  shown  at  the 
end  of  the  report. 


II.  SIGNAL  PROCESSING 


Three  styrofoam  cylinders  were  placed  under  water  in  a line  at  a 
depth  of  10  ft.  The  line  was  in  the  acoustic  "beam  and  parallel  to 
the  acoustic  axis  of  the  transmitter/receiver  system.  Two  types  of 
transmissions  were  used  to  illuminate  the  line  target.  These  were 
linear  FM  slides  where  T = 2 msec,  W = 8 kHz  and  T - 4 msec,  W = 8 kHz. 
An  illustration  of  the  processing  for  the  two  TW  products  is  shown  in 
Fig.  1.  This  shows  the  envelopes  of  the  transmission  pulse,  a return 
from  the  target,  a cepstrum  of  the  return  signal,  and  a return  from 
the  target  illuminated  by  a short  pulse.  The  transmission  pulse  was 
recorded  by  placing  a wideband  hydrophone  in  the  acoustic  beam.  It 
was  observed  that  usually  the  transmitting  crystal  was  operated  at 
resonance  (110  kHz).  The  consequence  was  that  the  transmitted  signal 
as  recorded  in  the  water  was  no  longer  a linear  FM  slide  but  had  a 
large  peak  in  the  center  of  the  slide,  plus  nulls  and  peaks  very 
near  its  center.  This  was  unacceptable  for  cepstrum  analysis  because 
of  the  required  flat  power  spectrum  in  the  transmission  pulse.  The 
center  frequency  was  shifted  to  125  kHz  and  the  result  was  an  IM  slide 
of  almost  uniform  amplitude  as  seen  in  the  figure.  For  each  type  of 
transmission,  9°  returns  were  quantized  and  stored  on  digital  tape. 

A cepstrum  was  computed  for  each  return  and  also  stored  on  digital 
tape.  In  addition  a short  pulse  (200  usee)  was  transmitted  and 
recorded  to  help  with  scaling  and  to  help  locate  the  target.  As 
shown  in  Fig.  1,  the  cepstrum  has  two  predominant  peaks.  The  first 
peak  corresponds  to  a correlation  between  the  returns  from  the  first 
and  second  cylinders  plus  the  second  and  th5rd.  The  second  peak 
corresponds  to  a correlation  between  the  returns  from  the  first  and 
third  cylinders. 
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TRANSMIT 


FIGURE  1 

SIGNAL  PROCESSING  ON  THE  RETURN  FROM  A 3 POINT  TARGET 


III.  STATISTICAL  ANALYSIS 


It  was  shown  in  Quarterly  Progress  Report  No.  1 that  the 
x,y  quadrature  components  of  the  cepstrum  can  he  averaged  to  obtain  a 
processing  gain  in  the  presence  of  a stable  target  plus  noise.  The 
envelopes  of  averaged  cepstrums  are  shown  in  Figs.  2 through  5-  Each 
envelope  is  the  average  of  three  cepstrums  and  when  they  are  compared 
with  Fig.  1,  it  is  clear  that  a processing  gain  has  been  achieved. 

It  is  also  clear  that  the  averages  are  fairly  consistent,  but  there 
are  changes  with  time.  The  changes  are  attributed  to  the  fact  that 
the  cylinders  are  moving  with  respect  to  each  other  from  ping  to  ping. 
This  means  that  if  too  many  cepstrums  are  averaged  together,  the  peaks 
in  the  cepstrum  that  indicate  the  presence  of  a target  can  be  obscured. 
A point  is  now  raised,  which  will  have  to  be  settled  later,  concerning 
the  practicality  of  averaging  the  x,y  quadrature  components  versus 
averaging  the  envelopes.  The  covariance  matrix  of  the  cepstrums  for 
the  two  TW  products  is  shown  in  Fig.  6.  In  the  quadrature  formulation 
the  covariance  matrix  is 


and  the  figures  plotted  are  actually  the  magnitudes  of  the  covariance 
matrix.  The  subscripts  i,  j refer  to  the  lag  times  of  the  cepstrum  and 


5 


PINGS  (1, 2, 3) 


the  superscript  k refers  to  the  ping  numbers.  The  main  diagonal  of 
the  matrix  corresponds  tc  the  square  of  the  average  envelope  and  it 
appears  that  in  this  case  the  average  of  the  envelopes  gives  an 
excellent  result.  That  is  to  say,  the  expected  peaks  are  large  with 
respect  to  the  surrounding  noise.  The  peaks  off  of  the  main  diagonal 
are  correlations  between  the  two  primary  peaks.  It  will  be  noted  that 
the  matrix  is  Hermitian,  which  is  to  say  that 


|Cov(t1,t^)|  = |Cov<T^,Ti)| 


(2) 


or  that  it  is  symmetrical  about  the  main  diagonal.  The  upper  left-hand 
corner  of  the  matrix  corresponds  to  the  origin  at  lag  time  0,0.  The 
lower  right-hand  corner  corresponds  to  7 msec,  7 msec.  This  is 
exactly  the  same  time  scale  shown  in  Figs.  2 through  5- 

To  gain  further  insight  into  the  statistical  nature  of  the 
cepstrum,  the  x components  were  tested  for  independence,  homogeneity, 
and  for  particular  probability  densities.  This  was  done  by  hypothesis 
testing.  For  example,  in  the  test  for  independence  it  is  assumed 
that  the  data  are  independent,  which  is  referred  to  as  the  null 
hypothesis  (Hq).  Then  a one  sample  runs  test  is  used  to  generate  a 
test  statistic  (number  of  runs)  which  has  a known  probability  distri- 


bution given  H 


If  H is  true,  then  the  test  statistic,  which  is  a 
o 


random  variable,  will  fall  below  a certain  threshold  with  a certain 
probability,  and  a decision  can  be  made  in  regard  to  the  independence 
at  a confidence  level  equal  to  the  specified  probability.  In  the 
particular  tests  to  be  shown,  a data  set  (S^)  consists  of  the  samples 


Si  = 


x*  |i  = constant,  k = 1,  2,  ...,90 


(5) 


which  would  correspond  to  samples  taken  from  a vertical  cross  section 

of  Fig.  2.  Again  it  is  mentioned  that  the  subscript  i refers  to  a 

lag  time,  t . In  the  runs  test,  consecutive  values  of  of  the  same 

sign  constitute  a run,  and  zeroes  are  ignored.  The  test  statistic  is 

the  number  of  nans  and  for  a large  number  of  samples  the  probability- 

density  of  the  test  statistic  is  a normal  distribution,  given  that 

H is  true, 
o 

By  homogeneity  one  means  that  the  data  samples  in  the  set,  S^, 
are  all  generated  by  the  same  statistical  mechanism.  The  test 
statistic  in  this  case  is  generated  by  subdividing  , obtaining  the 
cumulative  probability  distributions  of  each  subset,  and  then  finding 
the  maximum  differences  between  the  cumulative  probability  distributions. 
Given  Hq,  the  test  statistic  (maximum  difference)  is  Smirnov  distributed 
and  this  test  is  referred  to  as  the  Kolmogorov-Smimov  two  sample 
test.  In  the  particular  test  to  be  shown,  the  data  sets  were  divided 
into  halves. 

The  two  sample  test  can  be  extended  to  test  for  a theoretical 
probability  distribution.  The  difference  is  that  the  data  sample  is 
tested  against  a theoretical  distribution  instead  of  against  a subset 
of  itself.  This  is  referred  to  as  the  Kolmogorov-Smirnov  one  sample 
test. 


The  results  of  these  tests  are  shown  in  Figs.  7 and  8 for  the 
two  TW  products.  In  each  of  the  graphs  the  statistic  is  plotted 
versus  lag  time,  x . All  of  the  tests  are  independent  of  the  magni- 
tude of  the  samples  (nonparametric) . Threshold  lines  are  drawn  for 
50$  and  95$  confidence  levels.  Usually  when  the  statistics  fall 
above  the  95$  level,  the  null  hypothesis  Ho  is  rejected.  It  is  seen 
from  the  test  for  independence  that  there  is  a significant  number  of 
rejections,  especially  in  the  region  around  the  principal  peaks  in 
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FIGURE  8 

STATISTICAL  TESTS  ON  x-QUADRATURE  COMPONENT 
OF  THE  CEPSTRUMS  OF  A 3 POINT  TARGET 
TW  = 32,  W = 8 kHz 
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the  cepstrum.  This  is  not  surprising  "because  it  already  is  known 
that  the  x,y  components  can  "be  averaged  to  obtain  a processing  gain. 
However,  it  is  discouraging  to  see  many  rejections  at  lag  times 
greater  than,  say,  2.5  msec.  Perhaps  a parametric  test  would  be  more 
suitable  here  to  obtain  the  significance  of  this  result.  It  would 
be  concluded  from  this  result  that  the  data  samples  are  random  variables, 
but  they  are  not  independent.  The  results  of  the  test  for  homogeneity 
are  marginal.  It  appears  that  there  might  be  trends  in  the  data 
causing  the  data  samples  to  appear  to  be  generated  by  different  mecha- 
nisms. On  the  whole  one  would  accept  the  data  to  be  homogeneous. 

Given  that  the  data  is  homogeneous,  one  naturally  wishes  to  know  the 
type  of  probability  distribution.  The  data  were  tested  for  normal, 
Cauchy,  uniform,  exponential,  Raleigh,  and  Poisson  probability  densi- 
ties. Except  for  normality,  all  of  the  hypotheses  were  rejected. 

The  tests  for  normality  are  shown  in  the  Figs.  7 and  8. 


15 


IV.  PROCESSING  GAIN 


) 


I 


A significant  part  in  the  study  of  applying  cepstrum  to  active 
sonar  is  to  determine  its  processing  gain.  For  this  purpose  cepstrum 
is  thought  of  as  a black  box  with  a certain  signal-to-noise  ratio  on 
its  input  and  a corresponding  signal-to-noise  ratio  on  its  output. 

The  processing  gain,  G,  is  then  the  ratio  of  the  output -to-input 
ratios.  That  is 


G = 


Peak(S+N)^/N 

<S+NV*in° 


(4) 


where 

2 

Peak(S+N)Q  = peak  signal  plus  noise  out  squared, 

Nq  = mean  square  noise  out, 

(S+N)in  = mean  square  signal  plus  noise  in, 

N.  = mean  square  noise  in. 
in 

This  is  a variation  of  the  definition  given  by  Steward  and  Westerfield 
and  it  will  be  seen  that  G is  a function  of  S^/N^.  Using  artificially 
generated  data,  G was  numerically  computed  as  a function  of  S^n/N^n 
for  three  different  processors  and  Fig.  9 shows  the  result  for  a time 
bandwidth  product,  TW=l6.  It  will  be  noted  that  the  abscissa  of 
Fig.  9 is  S/N  and  not  (S+N)/N.  The  results  of  Fig.  9 can  be  explained 
in  the  following  way.  The  processor  for  the  replica  correlator  is  a 
crosscorrelation  between  a replica  of  the  transmitted  signal  and  the 
received  signal.  As  S/N  -*  »,  the  definition  given  by  Eq.  k approaches 
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PROCESSING  GAIN 


FIGURE  9 

THREE  PROCESSING  GAINS  FOR 
TW  = 16 
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the  definition  given  "by  Steward  and  Westerfield*.  In  this  case, 

G=2TW,  which  is  shown  in  the  figure.  In  the  case  of  S/N  -»  0,  it  is 
clear  that  the  noise  will  dominate  and  G will  approach  a constant 
equal  to  two. 

The  results  for  the  autocorrelation  and  the  cepstrum  require 
an  explanation  of  the  procedure  used  in  the  calculation.  The  primary 
point  of  interest  is  to  be  able  to  detect  the  time  difference  between 
two  received  replicas  of  the  transmitted  signal  superimposed  in  a 
noise  background,  as  shown  in  Fig.  10(a).  The  autocorrelation  of 
Fig.  10(a)  is  shown  in  Fig.  10(b),  and  the  cepstrum  of  Fig.  10(a)  is 
shown  in  Fig.  10(c).  The  processing  gain  is  computed  by  repeating 
the  experiment  10  times  and  averaging  to  get  the  peak  S+N  out  squared. 
The  process  is  repeated  for  noise  only  and  typical  results  are  shown 
in  Fig.  11.  The  G for  the  autocorrelation  grows  as  the  square  once 
the  peak  signal  plus  noise  out  is  greater  than  the  surrounding  mean 
square  noise  out.  In  the  case  of  the  cepstrum  the  mean  square  output 
is  independent  of  the  signal  amplitude,  and  as  S /N^n  -»  00 , the  G 
goes  to  zero.  This  is  explained  more  fully  in  the  appendix.  It 
would  appear  from  Figs.  10(b)  and  10(c)  that  the  processing  gains  for 
autocorrelation  and  cepstrum  are  the  same.  This  is  not  true  because 
the  autocorrelation  is  magnitude  dependent,  which  can  be  seen  by 
comparing  the  scales  in  Fig.  10(b)  and  Fig.  ll(a),  which  are  the 
autocorrelation  outputs  for  S+N  and  N only.  It  is  observed  that  the 
mean  square  noise  out  is  smaller  than  the  mean  square  signal  plus 
noise  out.  This  accounts  for  the  fact  that  G is  greater  for  the  auto- 
correlation than  it  is  for  the  cepstrum.  The  processing  gain  G is  also 
a function  of  the  TW  product  and  Fig.  12  shows  a family  of  G's  for 

various  TW's  and  S.  /Nj  ratios.  It  is  clear  that  the  G increases 
in'  in 

with  TW,  as  would  be  expected. 


Peak(S)^/N 

G s7n  as  S/N  - 00 
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CEPSTRUM  AUTOCORRELATION  SIGNAL  + NOISE 
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FIGURE  10 

SIGNAL  PROCESSING  ON  SIGNAL  + NOISE 
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V.  COMMENTS 

The  work  that  has  been  done  on  the  processing  gain  is  of  real 
value.  For  example,  the  cepstrums  of  the  Lake  Travis  3 point  data 
were  of  large  signals  superimposed  in  a low  noise  "background  and 
consequently  the  cepstrum  peaks  indicating  the  target  were  small.  It 
is  now  clear  that  this  is  true  because  the  processing  gain  of  cepstrum 
goes  to  zero  as  the  signal-to-noise  ratio  goes  to  infinity.  This 
work  will  be  continued  to  include  the  effects  of  averaging.  A crucial 
point  has  now  been  reached  in  the  overall  program.  If  the  effect  of 
averaging  on  cepstrum  does  not  increase  the  processing  gain  by  several 
orders  of  magnitude,  then  it  will  be  recommended  that  cepstrum  is  not 
suitable  for  a processor  of  active  sonar  return  signals.  This  will 
be  studied  in  the  future. 


in 


APPENDIX  I 

The  purpose  of  this  appendix  is  to  find  analytically  the  cepstrum 

processing  gain  for  the  extreme  limits  of  S/N.  The  processing  gain 

2 

can  he  defined  several  different  ways,  and  in  this  study  it  will  he 
defined  as 


G = 


Peak(S+N)  /N 
o'  o 

(S+N).  /N. 
in'  m 


(A-l) 


= square  of  (signal  + noise)  peak  out, 

= mean  square  noise  out, 

= mean  square  signal  + noise  input,  and 
= mean  square  noise  input. 

Cepstrum  is  a complicated  processor  and  at  the  present  time  the 

function  of  G is  not  known  analytically  as  a function  of  S.  /N  , hut 

in'  in’ 

for  the  extreme  limits  of  S.  /N.  -»  0 and  S.  /N.  -*<*>,  G can  be  found. 

in'  in  in'  in 

Let  x he  a random  independent  variable  normally  distributed 

with  mean  0 and  variance  1.  Then  y=ax  is  a scaled  random  variable 

2 

with  variance  a , In  this  case  the  y represents  a noise  input  to 
the  processor  and 


where 


Peak( S+N)q 
N 

(S«)J 

N. 

in 


M 


M 


in 


*i  M / ^ 


2 2 2 
a x.  = a 
i 


(A-2) 


i=l 


i=l 
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The  mean  square  noise  out  is  a "bit  more  complicated.  Let  F{y)  "be  the 
Fourier  transformation  of  y.  Then  the  cepstrum  of  y is 


Cep 


F|ln(|F{y)|2)  - mean! 


p p p 

•where  | F Cy}  | = a |Flx}|  . Then 


Cep  = 


F lna2  + 21n|F{x}|  - me  an  | 


(A-3) 


(A-4) 


It  is  seen  that  the  mean  square  of  Cep  will  always  he  a constant  K 

2 

because  subtracting  the  mean  will  cancel  the  effects  of  lncr  and,  on 
the  average,  F(x}  will  always  be  the  same.  That  is,  Nq=K  is  a 
constant.  In  the  limiting  case  of  small  signal  and  large  noise,  the 
processing  gain  becomes 


Lim  Est 

s/n-»o 


Peak(S+NQ)  /No 


(S+N).  /N. 

' 'in'  m 


= 2KZIL_  2 
2,2 
a /a 


(A-5) 


where  it  is  found  that 


Est 


(Ho)  - p 


2X2(N  )dN  = 2K 
o 2 o o 


(A-6) 


because  N is  a chi-square  variate  with  two  degrees  of  freedom, 
o 

For  the  case  of  large  signal  and  small  noise,  we  consider  the 
special  case  of  a received  signal  r,  consisting  of  two  superimposed 
replicas  of  the  transmitted  signal, 


r(t)  = ax(t)  + ax(t-x) 


(A-7) 
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Then  the  cep  strum  of  r is  found  to  "be 


Cep  = F|ln2a2  + In|F(x}|2  + ln(l+costoT)  - meanj 


, (A-8) 


Cep  as  |fIcoscut}|2  = l(at  lag  time  equal  t) 


(A-9) 


Again  it  is  seen  that  the  cepstrum  output  is  independent  of  the  signal 
amplitude  because  of  the  removal  of  the  mean  in  the  process.  Then 


Lim  Est(G)  = Lim 

S/N- 


M-  M-0 

2,  2 Va  /c  / 

a /a  — ' < > 


(A-10) 


It  is  concluded  that  for  small  signal  and  large  noise,  G 
approaches  2,  and  for  large  signal  and  small  noise,  G approaches  zero. 
This  has  been  shown  experimentally. 
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